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Abstract 

We calculate the entropic part of partition function of a bubble embedded in a double stranded 
DNA (dsDNA) by considering the total weights of possible configurations of a system of two single 
stranded DNA (ssDNA) of given length which start from a point along the contour of dsDNA and 
reunite at a position vector r measured from the first point and the distribution function of the 
position vector r which separates the two zipper forks of the bubble in dsDNA. For the distribution 
function of position vector r we use the distribution of the end-to-end vector r of strands of given 
length of dsDNA found from the wormlike chain model. We show that when the chains forming 
the bubble are assumed to be Gaussian the so called loop closure exponent c is 3 and when we 
made correction by including self avoidence in each chain the value of c becames 3.2. 

PACS numbers: 87.14.gk,87.15.-v,87.15.ad 



1 



DNA is a set of two entangled polymers connected by hydrogen bonds between comple- 
mentary bases and base-stacking between nearest neighbour pairs of base pairs and wound 
around each other to make a helix Though a very long DNA behaves like a flexible 
polymer in a solution, at smaller length scales (which may be of the order of hundreds of 
base-pairs.) it exhibits considerable stiffness. The stiffness of a polymer chain is measured by 
the persistent length l p , the length scale at which the tangent vectors to the polymer curves 
are decorrelated [2j. On scale smaller than l p , bending energy dominates and the chain is 
relatively stiff with little or no bending. The l p for the double-stranded DNA (dsDNA) 
is experimentally known to be about 500A which is equivalent to 145 bases long along its 
contour [3] . In comparison, the persistent length for single stranded DNA (ssDNA) depends 
sensitively on the ionic strength of the solvent and can be as short as 10 A ^]. 

In physiological solvent conditions the average value of the interactions for a base-pair 



that stabilize the dsDNA structure is of the order of few knT (thermal energy) |5j and 
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thermal fluctuations can lead to local and transitory unzipping of the double-strands p, |7j ■ 
The cooperative opening of a sequence of consecutive base pairs leads to formation of local 
denaturation zones (bubbles). A DNA bubble consists of flexible single-stranded DNA and 
its size fluctuate by zipping and unzipping of base-pairs at the two zipper forks (points Pi 
and P2 in Fig. 1(b)) where the bubble connects the intact double-strands. As the persistent 
length l s of the ssDNA is several times smaller than that of dsDNA the two chains fluctuate 
relatively freely and make entropic contributions to free energy of the bubble. The existence 
of bubbles in dsDNA represents the classic competition between energy and entropy; the 
intact dsDNA regions are dominated by the interaction energy due to hydrogen bonding of 
pairs and base stacking, bubbles by the entropy gain on disruption of base-pairs. The average 
size of a bubble depends on the sequence of base-pairs, temperature and ionic strength and 
varies from few broken base-pairs at room temperature to few hundreds open base pairs close 
to melting temperature T m [8,9]. Below T m , once formed a bubble is intermittant feature 
and eventually zip close again. This DNA breathing can be probed on the single molecule 
level in real time by fluorosence method [7|. Above T m individual bubbles continuously 
increase in size and merge with vicinal bubbles until complete denaturation 10] . 

In the study of thermal denaturation of dsDNA [10,11,12] as well as in the study of static 
and dynamic properties of an intermittant bubble [13,14] one needs to know its free energy. 
The partition function of a bubble of length If, has been approximated by the number of 
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configurations of walks of 21^ length returnin g fo r the first time to the origin 15[ which in 
the limit — > oo assumes the following form 16j 



where // is a non-universal geometric factor while c, so called loop closure exponent, is a 
universal quantity. We refer the model leading to (1) as closed walks model. The factor a 
is used as an adjustable parameter to fit the denaturation curves of DNA and its value has 
been found to depend on the value of exponent c 3]. For example for c=1.75 the value of a 
that fits the experimental data is equal to 1.26 x 1CT 5 whereas for c=2.15, its value is found 
to be ten times larger (i.e. a = 1.26 x 10 -4 ) . Though a is called cooperativity parameter, its 
use is essentially to correct the configurations of the bubble which have been overestimated 
by the closed walks model. 

For closed random walks embedded in three dimensional space (3-d) the exponent c has 
been found to be 1.5. The inclusion of the excluded volume interactions within the bubble 
gives c=1.76 [l^]. When the excluded volume interactions between a bubble and the rest 



of the chain is taken, then c ~ 2.11 



Q. 
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18]. The nature of the melting transition depends 



on the value of c [10|- For c < 1 there is no transition, for 1 < c < 2, the transition is 
continuous while for c > 2, the transition is first order. 

The closed walks model which lead to Eq.(l) with values of c given above for the partition 
function of a bubble, however, ignores the fact that the bubble is embedded in a dsDNA 
which imposes constraints on the distribution function of vector r connecting the two ends 
(zipper forks) of the bubble. In this Letter we show that when this constraint is taken into 
account the value of c becomes greater than 2 even when the bubble is assumed to be formed 
by two non-interacting Gaussian chains. 

In Fig. 1(a) we show a segment of dsDNA in which points Pi and P2 are separated by 
distance 4 = nbo along the contour of the chain. Here n is the number of base-pairs between 
P\ and P2 and 60 — 3.4A,the average distance between successive base-pairs in the Watson- 
Crick model of dsDNA [1]. In space the points Pi and P2 are separated by vector r. In 
Fig. 1(b) we show a bubble which is formed by unzipping of all n base-pairs between P x 
and P2. The points Pi and P2 represent the two zipper-forks of the bubble. Each of the 
two chains C\ and C2 of the bubble is of length Iq = na$ where ao — 6 A. To calculate 
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the entropic part of partition function of the bubble of length lb formed by unzipping of n 
base-pairs of dsDNA we use the relation 

Z n {l h ) = [ drZ n (r,l Q )p(r,l b ) (2) 



where Z n (r,l ) are the total weights of the possible configurations of a system of two 
ssDNA each of length Iq which start at origin and reunite through hydrogen bonds and 
stacking interactions at a point defined by position vector r and p(r, Z&) is the normalized 
probability density of position vector r separating the two zipper forks of the bubble in 
dsDNA. As we show below the factor p(r, lb) which has been ignored in the closed walks 
model plays crucial role in determining the total number of configurations of a bubble. 

If we assume each chain forming the bubble to be Gaussian , its distribution for the 



end-to-end vector for l » l s can be written as 
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/4tt \~ 3 / 2 /-3r 2 \ 
Pn(r,lo) = [ Y lol s ) j (3) 

where 2l s is taken to be equal to Kuhn statistical segment length. All distances here and 
below are expressed in unit of &o and therefore made dimensionless. The total weights of the 
possible configurations of two random walks starting from point Pi (origin) and reuniting 
at a distance r at point P 2 are therefore 

/47r \~ 3 /— 3r 2 \ 
^(r,i„)=( T W,j ^{ W: ) (4) 

When one substitutes Eq.(4) in Eq.(2) and chooses p(r, lb)=l one gets Z n {l h ) oc l/lb° with 
c = 1.5 in agreement with the result reported in literature for closed random walks. Note 
that choosing p(r,lb)=l amounts to assuming that the vector r separating the zipper forks 
of the bubble of length lb in a dsDNA can have values between zero and infinity with equal 
probability which is obviously incorrect as it ignores the fact that the bubble is embedded in 
a dsDNA and its ends separation is constrained by the dsDNA strands. As shown in Fig. 1(b) 
the zipper forks Pi of the bubble on its left and the fork P2 on its right are connected to long 
strands of dsDNA. The fluctuational motion of these forks in space (not along the dsDNA 
chain which changes the value of lb) will therefore be controlled by these strands. Here we 
are considering dsDNA of contour length 00 , h/L — > and the bubble located away 
from the two ends of dsDNA . For a dsDNA which contour length is of the order of l p or less 
the bubble formation may change the conformational behaviour of the chain [26,27], but for 



oo such change is expected to be negligible as any change in their position amounts to 
moving or rotating the dsDNA segments attached to these points on either or both sides 
which may cost huge energy. It therefore seems reasonable to assume that the distribution 
of distance r separating points P\ and P2 in Fig. 1(a) remains unchanged after the formation 
of the bubble and p(r, i&) can be approximated by the probability density of finding points 
Pi and P2 at separation r = r(s) — r(s)' where = s — s' is the distance between Pi and P2 
along the contour of dsDNA. 

In order to understand the effect of possible approximation to p(r, lb) it may be useful 
to consider some limiting cases : In the case of dsDNA being a rigid rod of infinite length, 
p(r, l b ) can be given as p(r, lb) = S(r — /;,)/47r/ fe 2 as positions of Pi and P 2 remain unchanged 
due to formation of the bubble. For this case the partition function Z n (lb) reduces to 
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Zn(h) = ,3 (5) 



L b 



where 



K l = (ra) and fI G = e 
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In another limit of dsDNA being a freely joined phantom chain, p(r, l b ) can be found from 
the end-to-end distribution function of a Gaussian chain of length h » l p - Thus [3] 

( 1 \ (^l b l p y 3/2 f-3r 2 \ 
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where Kuhn statistical segment length is equal to 2l p . Substituting this in Eq.(2) we get 

K / 3 \ 3 / 1 \ 3 /2 

ZM = T K2= WJ (200^ + 00^ J (7) 

In both limits we find c=3. 

A dsDNA being a semi-flexible polymer, its conformational properties can be found from 
a wormlike chain model in which the polymer is represented by a differential space curve 



r(s) of length L parametrized to arc length 20|- The model is specified by the Hamiltonian, 
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where t(s) = is the unit tangent vector to the curve r(s), s measures the position 
along the contour, (3 = {ksT)^ 1 and ksTl p = k specify the stiffness of the chain. The 



inextensibility of the chain is expressed by the local constraint |t(s)| = l. The distribution 
function p s (r, l p ) of end-to-end vector r of a segment of contour length 4 can be found from 
the average of £(l,n)5(r — jQ lb i(s)ds) over all chain conformations. Here £(1, n) represents 
the boundary condition that ensures smooth variation of tangent vectors at the two ends of 
the segment in dsDNA. In an ensemble of segment of length If, in a dsDNA which in solvents 
forms a fractal structure, we may relax the boundary condition and approximate p s (r, lb) by 
the end-to-end distribution function of a chain of length If, with a free end. 

The distribution function p s (r, If) has been found by Monte Carlo simulation [21,22] and 
analytically using approximate schemes [23,24,25]. For h/lp < l,p s (r, lb) exhibits sharp peak 



at r ~ If, whereas for h/l P > 1 the peak broadens and shi 



h/lp > 1 a simple analytical expression has been found 23j which can be written as 



ts to smaller values of r/7&. For 



Ps(r ' h) = Nc { 1 - f ) exp {tt p (T^m) (9) 

where N c is the normalization constant and its value is found by the requirement 

POO 

4vr / p(rj b )r 2 dr = 1 (10) 
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Eq.(9) reproduces quite accurately the simulation results for l b /l p > 1 [21]. 

If we use p s (r, lb) given by Eq.(9) for p(r, lb) and substitute into Eq.(2) we get for lb/l P > 1 

Zn(k) = ^ (ID 

where 

K(1 , 1 4 + 12( 7a )' 1 + 15( 7 a)~ 2 / 3 x 3 

a = lk/l p and 7 = 1 + -^l 

The value of K(l b ) depends on the value of lb/l p - For l b /l p ^> 1, K 3 (l b ) reduces to 

3 1 \ 3 /1\ 3/2 

' ' ! (13) 
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which agrees with the value given in (7). We note that while the exponent c remains 3 the 
prefactor K depends on the form chosen for p(r, lb). 

The inclusion of excluded volume interactions will decrease the number of configurations 
found from the Gaussian chains forming the bubble. To see this we consider the effect 
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of self-avoiding interactions within chains C\ and C2. The number of self-avoiding walks 
(SAW S ) which start at origin and arrive at a point deSned by the positron vector r is given 

as [16] 

P„(r, io) ~ ^it^gir/Q (14) 

where /i is the connectivity constant, g(x) is a scaling function, 7 is the entropic and v is 
the metric exponent. Though the function g(x) is not exactly known for SAWs, it has been 
approximated as g(x) ~ x^e' XxS where A > ,5 = and (ft can be expressed in terms of 
known exponents [15,28]. For the random walks = and v — 1/2. As the bubble is formed 
by two self-avoiding walks each of length Iq with common end points, the total weights of 
all configurations of the bubble of which the two end points are separated by distance r are 

Z n {rM ~ » 2h ll {r ~ l ~* v) H{r/l v Q ) (15) 

where 

■H{r/ll) ^ exp[-2\{r/l^' 

The integral in Eq.(2) is no longer Gaussian, but can be evaluated using the steepest 
descent method. Here we, however, take simple view and consider the limiting case of the 
bubble being embedded in an infinitely long rod shaped dsDNA. As for this case p(r, lb) = 
5(r— lb) / 4:irlb 2 , the integral is easily evaluated giving c = 2— 2u+Qu. Using approximate SAW 
exponents, 7 = 1.158, v = 0.588 [16] we find c = 3.2. The analysis given here ignores the 
excluded volume interactions between the two chains and between chains and the segments 
of dsDNA which inclusion will further increase the value of c. A more systematic treatment 
of the effect of excluded volume interactions on c will be given in a future publication. 

In conclusion, we suggest that the partition function of the bubble embedded in a dsDNA 
should be calculated using expression given by Eq.(2). There are two terms in this expression. 
First, the total weights of possible configurations of a system of two ssDNA of given lengths 
which start from a point along the contour of dsDNA and reunite at another point on dsDNA 
at a distance r. The points of origin and reuniting of two ssDNA are the points where bubble 
connects with the intact dsDNA strands and are called zipper forks. Second, the probability 
density of finding the zipper forks of a bubble of given length at distance r apart. In the 
closed walks models which have been used to estimate the partition function of the bubble 
the second term has been neglected. We calculate the total weights of possible configurations 



of two chains forming the bubble by treating them to be noninteracting Gaussian chains. We 
also consider the case when they are represented by the self-avoiding walks model. For the 
probability density of finding the zipper forks at distance r apart in long dsDNA we suggest 
that it can be approximated by the distribution function of end-to-end vector r calculated 
for dsDNA using the wormlike chain model. For the Gaussian chain model the exponent c 
is found to be 3 and when self-avoidence in each chain were considered its value increased 
to 3.2. Inclusion of the excluded volume interaction between chains and between segments 
of dsDNA will further increase the value of c. 
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FIG. 1: Fig. 1(a) Shows a segment of dsDNA. The contour length between points Pi and P2 which 
are separated by distance r is 3-d space is equal to i& = nbo where n is the number of base-pairs 
and 6o=3. 4A 1(b) Shows bubble formed by unzipping of all n base-pairs between points Pi and 
P2. The length of single stranded DNA C\ and C2 is equal to Iq = nao where ao — 6 A. The points 
Pi and P2 represent the zipper-forks of the bubble. A zipper fork is junction point between dsDNA 
and the bubble. 
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